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1. Introduction 

Several large scale interferometric instruments are being commissioned HI |2| or have 
started science runs OHU, aiming at the detection of gravitational waves (GW) emitted by 
astrophysical sources. The interferometers are sensible in a relatively wide frequency band, 
which makes it possible to resolve the structure of short bursts of GW, like those emitted 
during the Type II supernova explosions, or the merger phase in the binary black-holes 
coalescence (see [5| and references therein). 

However it has been frequently argued OB that the knowledge of the burst waveforms is 
rather poor, and therefore the classical Wiener filtering may be not applicable: in this context, 
it is advisable to develop methods which do not rely on the theoretical signal waveform 
(see [8 1 for a recent review). One of us (A.V.) has proposed in one such method, which is 
claimed to be "optimal" under the assumptions that only the duration of the burst is known, 
and that the detector noise is Gaussian, but not necessarily white. 

In the method has been derived from those assumptions, and an algorithm has been 
defined that we will call "generalized delta filtering" (GDF), because it reduces itself to a 8- 
filtering when its "window size" parameter is chosen to be 1 . In that paper no test was made 
to assess the actual detection performance of the GDF. 

In the present work we have made such a test, using simulated waveforms ifTOl and the 
expected Virgo spectral noise density lfT2l . 

2. The method in brief 

Referring to [ 9 1 for details about the method and its derivation, we just recall that in the single 
detector case it amounts to apply a few simple analysis steps: 

(i) a 8-filtering step, also called double whitening ifPTl . which can be implemented in the 
frequency domain dividing the Fourier transform (FT) of the signal by the estimated 
spectral noise density; 

(ii) estimate the Discrete Karhounen-Loeve basis for a window of fixed size N of the 8- 
filtered data; we recall that the DKL basis is a set of N vectors which "diagonalize" the 
input noise, in the sense that coefficients Ck = x ■ \\f k of the decomposition of data x over 
the basis are statistically independent, that is E [c^q] lfl3l : 

(iii) slide the window over the available 8-filtered data and compute the statistic 

L =I^(^-x) 2 (1) 

where and \\f k are the eigenvalues and eigenvectors of the DKL basis; 

(iv) select candidates by comparing the data against a threshold r|. 

The only parameter of the algorithm is the window size N, and possibly the stride used 
in sliding the window over the 8-filtered data. 



Testing the performance of a blind burst statistic 



3 




0,1 0,2 0,3 0,4 



le-19 - 
le-20 - 
S le-21 

le-24 - 

le-25 ' 1 1 1 1 1 1 1 1 

10 100 1000 10000 

f(Hz) 

Figure 1. In the lower part, the expected spectral noise density of the Virgo detector is shown, 
superimposed to an example of spectrum obtained on simulated data. In the upper part, an 
example of noisy output is plotted in the time domain. 




3. The simulation 

3.1. Input noise 

We have used "standard" Virgo noise [12|, as modeled on the basis of the expected noise 
sources [ITU, including the thermal and shot noise in the frequency band [10, 10000] Hz. 

To generate noisy data, we started from white Gaussian noise, "colored" on the basis 
of the Virgo sensitivity curve. We show in Figure [T] an example of the simulated noise 
obtained using this procedure. Also the resulting spectrum is shown, superimposed to the 
"theoretical" spectrum. It is worth noting that the simulated data include the effect of the 
narrow resonances corresponding to the suspension "violin modes": it is yet not completely 
clear if these resonances will be filtered out before applying the detection algorithms, and we 
have chosen not to remove them. 

3.2. Simulated signals 

As prototype GW signals, we have used waveforms resulting from simulations of the core- 
collapse of Type II supernovae made by Zwerger and Miiller 1 10 1 . These waveforms are 
labeled by the values of the parameters A, 5, G which enter in the poly tropic equation of state 
of the nuclear matter. 

Our method is not optimal in the Wiener sense, and therefore it is crucial to set the scale 
of the signals in the simulations. We have chosen to normalize all the signals to the same 
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Figure 2. In anti-clockwise sense: simulated signal+noise, signal only, output of a 8-filtering 
step, output of a GDF analysis using an N = 4 window. 



intrinsic signal to noise ratio, defined as usual by: 



SNR 



intrinsic 



df 



(2) 



lo S n (f) 

where S n is the one-sided spectral density of the noise, and s is the FT of the signal considered. 



In our simulations we have set SNR; 



intrinsic 



5 to compare with the results in [ 8 1 . 



3.3. Examples of filtering chain 

It is instructive to have a first exploratory look at the results of a GDF analysis performed as 
in Sectional In Figure|2]we show what happens with an N = 4 window, choosing a Zwerger- 
Miiller (ZM) signal with a rather short duration, and very much like an instantaneous burst. 
In this case, the outputs of both the 8-filtering and the GDF analysis put clearly in evidence 
the event occurrence. In Figure[3]we choose a signal having a more complicated time-domain 
evolution: in this case, the 8-filtering is unable to capture the event, but a GDF analysis using 
an N = 32 window appears instead to put it in evidence, although other false alarms appear in 
the same time window, whose significance remains to be assessed. 

These examples are just indications that the method works: for a quantitative 
understanding we need to estimate the receiver operating characteristics (ROC). 
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Figure 3. In anti-clockwise sense: simulated signal+noise, signal only, output of a 8-filtering 
step, output of a GDF analysis using an N = 32 window. 



4. The ROC estimation 



It is worth recalling that the statistical properties of the GDF detector can be assessed 
analytically, because the statistic in Equation[T]is just a % 2 with N degrees of freedom. 
In presence of a signal, the statistic is a non-central % 2 , and its distribution is J9| 

where qF\ is the hypergeometric function. The distribution J(L|SNR) depends only on the 
SNR of the signal, as seen by the GDF method. Note that this SNR has nothing to do with the 
SNR mtrms j c , and it is defined as customary by the ratio 

E[L\Hi]-E[I\H Q \ 



SNR 



(4) 



{L-E[L\H }) 2 \H 



where Hq and H\ are the hypotheses of absence or presence of a signal, respectively. Using 
this definition it is immediate to show that 



SNR 



(5) 



for the GDF statistic. 

In order to estimate false alarms and detection probabilities, we need therefore first to 
compute how much SNR is collected by the GDF detector(s), for a fixed SNR mtrms i c - 
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Figure 4. The distribution of SNR for different values of the N parameter of the GDF 
algorithm, based on ZM input signals normalized to have SNR mtrms j c = 5 



4.1. SNR distribution of the ZM signals 

We have considered the 78 signals in the ZM family ifTOl . normalized w.r.t. the theoretical 
Virgo noise to have SNR mtrms j c = 5, and we have computed for each of them the SNR 
resulting from a GDF analysis, using different values of the window size N. We stress that 
there is no point in comparing the values of SNR for different detection methods or different 
window sizes N: only the comparison of false alarm and detection probabilities is sensible. 
We show in Figure HJthe resulting SNR distributions. 

4.2. False alarm and detection probabilities 

Starting from the distribution in Equation|3]it is immediate to derive the false alarm probability 

r°° rP-^ 

P FA (J\)= / d{L\Q)dL=^2± (6) 

while the detection probability Pdet (Tl|SNR) can be evaluated by a numerical integral. 

We stress that these Pdet, Pea are "per bin", namely they do not take into account the 
auto-correlation of the filter outputs, which would require to resample the output or to select, 
among the outputs above threshold, only the maxima over a fixed time window, whose length 
would be another parameter of the algorithm. 

For a given signal, we have varied the threshold r\ to obtain Pdet as a function of Pp&, 
and for each value of r\ we have averaged the probabilities over the different ZM waveforms; 
the resulting plot of Pdet against Pfa constitutes the average ROC curve, that we show in 



Testing the performance of a blind burst statistic 



X) 
a 
X) 

s 

&h 0,5 
c 
_o 

o 
i> 

Q 




Q ft— -I ^ UNim. i TT1ITT1 1 1 1 1 1 

le-07 le-06 le-05 le-04 le-03 le-02 le-01 le+00 



False Alarm Probability 



Figure 5. The ROC curves obtained averaging over the 78 waveforms of the ZM family (see 
text), for different window sizes N 



Figure |5] for different values of /V. The similarity of the plots may lead to argue that the 
window size is not very important. Actually this is just an effect of the averaging, as indicated 
by the examples in Figures|2]and|31 which show that one should better match the signal length 
and the window size /V. This is confirmed in Figure |6j where we plot the best and worst 
ROC curve obtained over the 78 ZM signals, for two different values (4 and 32) of the N 
parameter. The large dispersion of the results is a consequence of the large differences of the 
SNR recovered by the GDF filters over the various waveforms, as displayed in Figure |4j 

5. Conclusions 

In this paper we have started to assess the detection capabilities of the "generalized delta 
filtering" introduced by one of us in [9 1, by computing the receiver operating characteristics 
in the case of Virgo simulated noise, using signals from the Zwerger-Miiller family. 

The ROC curves we obtain are generally inferior to those obtained by some other burst 
detection methods JSJ; however we should mention that in [8| a white Gaussian noise was 
used, a difference with our computations whose relevance we are unable to assess. 

The results show a wide variation over the N parameter, and suggest that further 
optimization is possible: one should define a bank of filters with different N values and 
understand how to combine the various outputs and how to select candidates. As discussed in 
Section l4~2l one should take into account the fact that the filter outputs are correlated, and that 
one may strongly decrease the false alarm probability by re- sampling the output at a lower 
rate, while probably not hampering too much the detection probability. 




Figure 6. The best and worst cases of the ROC curves, obtained over the different ZM 
waveforms, for two different values of the N parameter. Note that the horizontal scale is 
different from the one used in Figure|5] 



These studies are beyond the scope of the present work and will be the subject of a 
forthcoming paper. 
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